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We investigate the competition dynamics of two microbial or viral strains that live in an environ- 
ment that switches periodically between two states. One of the strains is adapted to the long-term 
environment, but pays a short-term cost, while the other is adapted to the short-term environment 
and pays a cost in the long term. We explore the tradeoff between these alternative strategies in 
extensive numerical simulations, and present a simple analytic model that can predict the outcome 
of these competitions as a function of the mutation rate and the time scale of the environmental 
changes. Our model is relevant for arboviruses, which alternate between different host species on a 
regular basis. 



I. INTRODUCTION 

The quasispecies model 1] is the premier model to 
study the evolution of asexual replicators, such as self- 
rephcating molecules or viruses j^i, iSj . Originally formu- 
lated for constant environments, the quasispecies model 
has recently been extended to describe adaptation to a 
changing environment ^, ^, Ji, ^] . The various exten- 
sions of the model all work within the original determin- 
istic framework developed by Eigen and coworkers, and 
thus assume that the population size is infinite. This as- 
sumption implies that a population can never lose any 
genetic information. However, the loss of genetic mate- 
rial, and the mechanisms that prevent it from occurring, 
are probably major forces shaping the evolutionary dy- 
namics of finite populations in time-dependent environ- 
ments: A finite population can lose to mutation pres- 
sure previously useful genetic material that has become 
meaningless after a change in the environment, and the 
population may not be able to reacquire this material 
when the environment changes back to its original state. 
Consequently, the population will be at a selective dis- 
advantage in comparison to another population that has 
managed to prevent a similar loss, even if the second 
population had to pay some short-term cost to keep the 
useless genetic material. Therefore, in an environment 
that alternates between two (or more) states, natural se- 
lection faces two conflicting agendas — specialization to 
the current state of the environment, or adaptation to 
the long term environment which includes both environ- 
mental states. 

Here, we study the evolutionary dynamics of compet- 
ing finite populations of asexual replicators in an envi- 
ronment that alternates between two states, remaining in 
each state for a time interval of length T/2 before switch- 
ing to the other. To study the tradeoff between the com- 
peting forces of selection in this environment, we consider 
two different strains of replicators, as previously proposed 
0. The first strain, which we refer to as the fused strain, 
has a single gene that performs equally well in both en- 
vironmental states. The second strain, which we refer to 
as the divided strain, has two genes, each of which is ad- 
vantageous in one environmental state and useless in the 



other. Clearly, if the fused strain performs as well as the 
divided strain in both environments, without paying any 
additional cost, then the divided strain cannot have a se- 
lective advantage over the fused strain, regardless of the 
time scale on which environmental changes happen. If, 
however, the fused strain does pay some small cost, then 
which strain is advantageous depends on the exact inter- 
play of the cost, the time scale of environmental change, 
and the mutation rate. Here, we develop a method to 
assess and analyze this interplay, and to predict which 
strain is advantageous in a given setting. As cost, we con- 
sider the differential mutation pressure that arises when 
the fused and the divided strain have genes of different 
length 0[ ^1 . However, it is straightforward to extend 
our approach to other types of costs. 

Note that while we refer to separate genes throughout 
this paper, our model can also apply to separate functions 
carried out by a single gene. In this case, the divided 
strain corresponds to a gene that can adapt to either 
function, but not to both at the same time, whereas the 
fused strain corresponds to a gene that can adapt to both 
functions at the same time. Such a situation has been 
observed in an artificial-life simulation with a changing 
environment [T^ . where the fusion of genetic function 
evolved presumably through changes in the amount of 
epistatic interactions among the different parts of the or- 
ganisms' genomes. 

II. MATERIALS AND METHODS 
A. Model 

Here, we model the evolutionary dynamics of a finite 
population in a time-dependent environment. For com- 
parison, for an inifite population in a time-dependent en- 
vironment, the quasispecies equation reads [3 

j 3 

where yi is the fraction of type i in the population, Wi{t) 
is the replication rate (i.e., fitness) of type i at time t, and 



2 



Hij is the mutation rate per unit time from type j to type 
i. The quadratic term corresponds to the total produc- 
tion of new organisms per unit time, and is subtracted 
to keep the yi{t) normalized. 

We represent all genes as binary strings. The divided 
strain has two genes, each of length Ldiv, and the fused 
strain has a single gene of length Lfusc- For each gene, 
there exists a single functional sequence (the master se- 
quence) that confers the selective advantage, and all al- 
ternative sequences are non-functional, regardless of the 
environment. The reproductive fitness 'Wi{t) of an in- 
dividual is determined by whether the individual has a 
functional gene specialized for the current state of the 
environment. An individual with the correct functional 
gene has fitness 1 -I- s, while an individual without such 
gene has fitness 1. Mutations occur upon reproduction 
with a per-site probability /i, corresponding to a per-gene 
mutation rate oi U — fiL. 

B. Simulation 

Both the speed of environmental change and the 
mutation rate fi are important factors in determining 
the outcome of the competition between the divided 
and fused strains. To assess their relative importance, 
we simulated a population of = 1000 individuals 
reproducing in discrete generations, and with prob- 
ability of reproduction proportional to their fitness 
(Wright-Fisher samphng). Initially the population 
was divided equally between fully functional members 
of the two strains and the simulation continued un- 
til one strain became extinct. We fixed the length 
of the divided strain at Ldiv — 5, while Lfusc var- 
ied from 3 to 11. We performed 10,000 replicates 
at each pair of period lengths and mutation rates, 
for T/2 (10,30,100,300,1000,3000,10000) and t/div = 
/iLdiv (0.0001, 0.0003, 0.001, 0.003, 0.01, 0.03, 0.1, 0.3, 1, 3). 

III. RESULTS 

A. Time Scales 

To understand the dynamics of competition between 
the divided and fused strains in a finite population, we 
consider two time scales — the competition time scale Tc 
and the drift time scale T^r- In order to calculate these 
time scales, we need to know the selective advantage of 
one strain over the other. The selective advantage s of 
genotype i over genotype j is a key quantity in theoretical 
pop ulation genetics, and is defined as s = (wi — Wj)/wj 
[l3j- While this definition is a priori applicable only to 
individual genotypes, it turns out that to a good approx- 
imation standard results from population genetics can be 
applied to separate strains (which consist of a mixture of 
closely related mutants) if we treat each strain as an indi- 
vidual genotype with fitness given by the strain average 



{w) Throughout this paper, we refer to the selec- 

tive advantage of one strain over another as the effective 
selective advantage of this strain. We define the effec- 
tive fitness advantage of the fused strain over the divided 
strain by 

min{(wf 

use ), (w^div)}' 

This definition guarantees that the magnitude of SeS cor- 
responds to the fitness advantage of the superior strain, 
while the sign indicates whether the fused strain is supe- 
rior (positive Scff) or inferior (negative Scff). If one strain 
has an effective fitness advantage |soff I over the other, the 
competition time scale Tc is defined to be the typical time 
until extinction of the inferior strain in a constant envi- 
ronment. Neglecting finite population effects, and ap- 
plying Eq. Q to strains rather than genotypes, we find 
that the population fraction x{t) of the superior strain 
changes approximately according to the logistic equation 

x{t) = \s,s\xit)[l - x{t)l (3) 

subject to our initial condition that x{0) = 1/2. To de- 
termine the typical extinction time of the inferior strain, 
we solve Eq|31for the time when a single member remains 
of the inferior strain. Thus 

_ \n{N - 1) ^ IniV 

\SeB\ |Scff| 

The drift time scale is defined as the average time for a 
neutral mutation to go to fixation. Neutral drift becomes 
important when the fitness advantage between the com- 
peting strains is small compared to the fluctuations due 
to finite sampling effects, i.e. when Soff ^15j. For 

our initial conditions, diffusion theory 16] predicts that 

Tdr = 2iV In 2 « 1390 generations. (5) 

B. Quasispecies Effects 

Given sufficient time to reach equilibrium, each strain 
will adopt a quasispecies distribution consisting primar- 
ily of those members with a functioning gene adapted to 
the current environment, together with the deleterious 
mutants that are constantly regenerated through muta- 
tion pressure. In our dynamic fitness landscape, the fused 
strain will attain this equilibrium distribution after some 
time, while the divided strain will attempt to equilibrate 
to the current environment and then go through a period 
of transition when the environment changes. As an exam- 
ple. Figure n illustrates the formation of the fused-strain 
quasispecies and the dynamics of the divided-strain qua- 
sispecies in the limit of large population size. 

Before we can use the time scales derived in the pre- 
ceeding section to predict the competition's outcome, we 
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must estimate the average fitness of each, strain appear- 
ing in Eq|21 In our model, the average fitness {w) of a 
strain is given by (w) = 1 + sj/q, where yo is the frac- 
tion of the population that has a functional gene for the 
current environment. Thus Eq [21 becomes 

Scff = — —f r ■ (6) 

To estimate yo, we assume that the quasispecies immedi- 
ately reaches its equilibrium distribution. Denoting yi (t) 
as the population fraction of a given strain with i errors 
at time t, we neglect the back mutation term (that is, 
the term that represents mutations from genotypes with 
errors to error-free genotypes) in Eq. JQ) and obtain 

ya{t) = (1 + s)Qoo{L)ya{t) - [1 + syo{t)]yo{t), (7) 

where Qij{L) is the probability that a string of length 
L with j errors mutates into one with i errors. Qij{L) 
has been given for example in Ref. Setting yo = 

in equilibrium, we find yo = [(1 -|- s)Qoo{L) — l]/s. 
However, when back mutations become significant and yo 
approaches zero in this expression, we reach the classical 
error threshold jllj. For mutation rates beyond this point, 
we assume that the population is randomized uniformly 
over all possible states, and hence we use 

f (1 + .s)Qoo{L) - 1 
2/0 = max I ,2 (8) 

for all mutation rates. For mutation rates below the error 
threshold, Eq. ||SJ) yields a simple form for the magnitude 
of the fitness advantage, |scff | = (1 — ^^^l-^div-ituad _ 
while the sign of s^s is given by sgn(L(jiv — ifusc)- This 
result shows that the effect of mutational load on fitness 
is to favor whichever strain has the shorter length. 

The result of Eq (|HJ applies to a quasispecies that has 
reached equilibrium. While the equilibrium assumption 
provides a good estimate for the fused strain, the rate of 
environmental changes may prevent the divided strain's 
quasispecies from ever reaching this equilibrium. If the 
environment changes quickly (relative to the competition 
and drift time scales), the divided strain persists in an 
average environment that requires both genes for func- 
tionality 0, and hence this strain has an effective gene 
length of 2Ldiv In this case, we approximate the re- 
sulting quasispecies as one with a single gene of length 
2Ldiv and replace Qoo{Ldiv) by (5oo(2idiv) in Eq. 
Even though this approximation disregards the two di- 
mensional nature of the divided-strain quasispecies, it 
gives a reasonably good estimate of the true yo for most 
mutation rates. Note, however, that we do not replace 
2-Ldiv 2-2Ldiv^ j^^ g^j^y point in time, the environment 

favors only one of the divided strain's two genes, and 
hence beyond the error threshold the probability that a 
randomly-chosen individual from the divided strain car- 
ries a functional gene remains 2~^'^'^. We refer to ScS 
calculated with QooiLdiv) as the short-term limit, and to 
Sefi calculated with (5oo(2idiv) as the long-term limit. 



C. Predicting the probability of fixation 

We propose a simple ternary model to predict the prob- 
ability of fixation p of the fused strain. In our model, p 
is if the divided strain is favored, 1/2 for neutral evo- 
lution, when both strains are equally likely to go to fixa- 
tion, or 1 if the fused strain is favored. First, we classify 
the selective regime based on the drift time Tjr and the 
short term competitive time scale Tc = In Af/|s^g°''*| in 
comparison to T/2, the length of time for which the en- 
vironment remains constant (see Tabled). If Tc < T/2, 
then we expect the competition between the two strains 
to end before the environment changes even once, and 
hence the short-term limit applies. The value of Tjr is 
irrelevant in this case. If both times are longer than T/2, 
then we expect the competition to extend over several 
half-periods, and hence the long-term limit applies. Fi- 
nally, if T/2 is smaller than Tc, but larger than Tdr, then 
we expect drift to be the dominant force. We call this 
regime the neutral limit, and set SeS = 0. Having deter- 
mined the appropriate limiting case (short-term, neutral, 
or long-term limit) , we can use the associated fitness ad- 
vantage Soff to predict the probability p of fixation for 
the fused strain (Table EJ- If koffl < ^/N, then the two 
strains are effectively neutral. Hence, the outcome of the 
competition is determined by drift, and p = 1/2. Other- 
wise, p = or 1 depending on whether SoS is negative or 
positive. 



D. Comparison with simulation results 

From our simulations, we estimated the probability p 
that the fused strain would fixate in the population as a 
function of T/2, Udw, and Tfusc- Results are shown in 
Figure B, C, for representative values of Lfusc- The 
probabilities obtained by simulation have a standard er- 
ror of ±1%. Figure |2t), E, F shows the corresponding 
predictions of our model. Over the full range of param- 
eters described in Section rTTBl our model predicted that 
the divided strain was superior in 235 cases {p = 0), that 
the strains were neutral in 240 cases (p = 1/2), and that 
the fused strain was superior in 155 cases {p — 1). The 
range of simulation results corresponding to each of these 
three predictions is shown in Figure 13 When the model 
predicted p = 0, 85% of all simulation probabilities fell 
in the range — 0.1. When p — 1 was predicted, 84% 
of simulation probabilities fell between 0.9 — 1, while in 
the neutral case {p — 1/2 predicted), 58% of simulation 
probabilities fell between 0.45 — 0.55. The rms error be- 
tween the simulation values and the model predictions is 
18.6%, averaged over all cases. For comparison, the best 
possible rms error for any such ternary model is 7.6% on 
this data. 



4 



IV. DISCUSSION 

The study of quasispecies dynamics in a time depen- 
dent fitness landscape to date has primarily focused on 
the limit of infinite population size 
a periodic fitness landscape, an infinite population size 
guarantees that competition between two strains will re- 
sult in the deterministic extinction of the inferior strain 
or, in certain finely tuned cases, an unstable coexistence 
between the strains (although frequeiicy dependent se- 
lection may stabilize this equilibrium |l8l|). In contrast, 
the generalization of these models to a finite population 
presents a continuous range of possibilities from almost 
certain extinction to the complete randomness of neutral 
drift. 

In this study, we present a model for predicting the 
outcome of competition between finite quasispecies' in a 
periodic environment. As applied to our specific case of 
competition between a divided and a fused strain, our 
model shows good qualitative and quantitative results 
in comparison with simulation (Figs. [3 and When 
Lfuse < -^^div or Lfuse > 2L(jiv, One of the two strains 
is strictly superior, and the outcome of the competition 
is determined by whether drift or competition are more 
important at the given mutation rate. For intermedi- 
ate values of Lfuse the competitive dynamics are more 
complex, as certain combinations of mutation rate and 
period lengths favor the fused strain while others favor 
the divided strain. 

The only qualitative feature of the competitive dynam- 
ics we have ignored is the time for the quasispecies to 
reach its equilibrium distribution. We decided not to 
include a quasispecies time scale in our model after pre- 
liminary efforts showed that the added complexity failed 
to significantly improve model accuracy. While the fused 
strain quickly reaches its equilibrium independent of the 
environment's state, the divided strain may fare worse 
than predicted in some special cases when it spends sig- 
nificant time in transitions between the quasispecies dis- 
tributions for each environment, being poorly adapted in 
the meanwhile. Still, these cases are relatively rare since 
at low mutation rates quasispecies effects are less impor- 
tant, while at high mutation rates the error threshold 
masks any such effects. 

Our results suggest that, under appropriate circum- 
stances, a selective pressure exists to fuse or divide com- 
plementary genes in a periodic environment. The ten- 
dency of uncertain enviromental conditions to facilitate 
large scale genetic changes such as this one has recently 
been studied 0. Most genomes are full of apparently 



useless or non-functional genetic material, which in our 
model corresponds to the excess length of the fused strain 
over a single gene of the divided strain. In populations as 
small as 1000 such as those we studied, such temporar- 
ily useless genetic material comprising 20% of the genome 
could be stably maintained for periods of dormancy up to 
1000 generations (Figure|33). Alternatively, a fused gene 
exposed to an appropriate periodic environment might 
undergo division in response to the selective pressure we 
describe. The beginnings of a genome segmentation has 
been observed in foot-and-mouth disease virus (FMDV) 
in response to conditions of high multiplicity of infec- 
tion |20|. One explanation of this segmentation is that 
translational speed favors shorter genes |^ [S^l , a fitness 
effect similar to the length-based mutational pressure of 
our model. 

Our model applies directly to the evolution of ar- 
boviruses, which are viruses transmitted by arthropods. 
For example, the arbovirus West-Nile virus is transmit- 
ted from birds to birds (and the occasional human) by 
mosquitoes, experiencing the alternating environments 
of the avian and insect hosts. Experimental virologists 
have long tried to determine whether viruses subjected 
to such alternating environments adapt to the short-term 
or the long-term environment, but have not found a con- 
clusive answer. Experiments with vesicular stomatitis 
virus (VSV), eastern equine encephalitis virus (EEEV), 
and Dengue virus in cells of insect and mammalian ori- 
gin have shown that in some cases, adaptation to one cell 
type leads to loss of fitness in the other cell type, while in 
other cases fitness can increase in both cell types at the 
same time Which of the two 

cases occurs depends on the time spent in each of the two 
hosts, and also on the details of the fitness landscape in 
the two hosts. It stands to reason that in future exper- 
iments in which the time scale of environmental change 
is varied over a wide range, a switch in the adaptation 
strategy from short term to long term will be observed, 
and that the time scale at which the switch occurs can 
be predicted with the methods we have developed here. 

In conclusion, we have demonstrated that the compet- 
itive dynamics of finite populations in a time-dependent 
environment can be quite complex, but that neverthe- 
less, estimates of the effective fitness advantages of the 
different strains together with an understanding of the 
drift and competitive time scales can lead to remarkably 
accurate predictions of the evolutionary dynamics. We 
believe that similar techniques will prove useful to inter- 
pret and predict outcomes of virus-evolution experiments 
in changing environments. 
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TABLE I: Selective regime, as determined by the relative 
magnitudes of Tdr, To, and T/2. 



Condition 


Selective Regime 


r/2 > Tc 

Tdr < T/2 < Tc 
T/2<Td,,Te 


short term limit 

neutral limit 
long term limit 



TABLE II: Model predictions, as determined by the relative 
magnitude of Ses and 1/N. 



Fitness A<lvtuitagc (Fused Strain) 


Fiodictioii 


P 


Seff < -l/N 


divided strain wins 





-1/N < Seff < l/N 


neutral* 


1/2 


Scff > 1/iV 


fused strain wins 


1 



* In the neutral case, both strains are equally likely to win. 



7 



# mutations 
gene 1 




t=0 t=10 t=20 t=30 t=40 t=50 t=60 t=70 



to CD 

£ C 

O CD 

E to 



t=0 



t=10 



I 



t=20 



t=30 



t=40 



t=50 



I 



t=60 



t=70 



FIG. 1: Population structure of the divided and fused strains at various time points. Gray levels indicate the fraction of 
sequences at the given mutational distance from the respective error-free sequence. Parameters are: Oscillation period T = 60, 
per-site mutation rate ^ — 0.02, length of a single gene of the divided strain Ldiv = 5, length of the fused gene Lfusod = 8, 
selective advantage of functional gene s — 1, infinite population size. From 0. 
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FIG. 2: (Color online) Left (A, B, C): Simulation results for the probability of fixation of the fused strain as a function of the 
mutation rate Udiv ~ fJ^Ldiv and period length T for Ldiv = 5 and Lfusc ~ 4,6, 11 (top to bottom). Simulation results have a 
standard error of approximately ±1%. Right (D, E, F): Model predictions for the same. Parameter values are s = 1, A'^ = 1000. 
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FIG. 3: Fixation probabilities for the fused strain (as determined by simulation), classified by model prediction. The model 
predicted p = in 235 CEises, p = 1/2 in 240 cases, and p = 1 in 155 cases. The x ajcis is binned in 0.1 increments, with the 
bins' midpoint value shown. 



